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On the Analytic Wavelet Transform 

Jonathan M. Lilly, and Sofia C. Olhede, 



Abstract 

An exact and general expression for the analytic wavelet transfoiTn of a real-valued signal is constructed, resolving the 
time-dependent effects of non-negligible amplitude and frequency modulation. The analytic signal is first locally represented as a 
modulated oscillation, demodulated by its own instantaneous frequency, and then Taylor-expanded at each point in time. The terms 
in this expansion, called the instantaneous modulation functions, are time-varying functions which quantify, at increasingly higher 
' orders, the local departures of the signal from a uniform sinusoidal oscillation. Closed-form expressions for these functions are 

^ , found in terms of Bell polynomials and derivatives of the signal's instantaneous frequency and bandwidth. The analytic wavelet 

^— ■») ■ transform is shown to depend upon the interaction between the signal's instantaneous modulation functions and frequency-domain 

' derivatives of the wavelet, inducing an hierarchy of departures of the transform away from a perfect representation of the signal. 

The form of these deviation terms suggests a set of conditions for matching the wavelet properties to suit the variability of the 
, signal, in which case our expressions simplify considerably. One may then quantify the time-varying bias associated with signal 

■ estimation via wavelet ridge analysis, and choose wavelets to minimize this bias. 

QQ , Index Terms 



(N 



in 



oo 



Complex wavelet, Hilbert transform, wavelet ridge analysis, amplitude and frequency modulated signals. 

I. Introduction 



This paper derives properties of the Analytic Wavelet Transform (AWT), a special family of complex-valued wavelet 
transforms, for the analysis of modulated oscillations. The complex-valued wavelet transform has emerged as an important 
non-stationary signal processing tool; a discussion of its properties together with references may be found in [1]. Continuous 
^ complex wavelets have been used for the characterization of modulated oscillatory signals [2]-[6] and discontinuities [7]. The 
~^ applications of complex wavelets to analysis of real signals includes mechanical vibratory signals [8], [9], seismic signals [10], 
position time series from drifting oceanic buoys [11], and quadrature Doppler signals in blood flow [12]. Much attention has 
^ also focused on the design of discrete wavelet filters that approximate the effect of an analytic continuous wavelet transform, 
with important contributions by Kingsbury [13], Selesnick [14], and others [15], [16]. In addition, the signal form we discuss 
CO in this article has been used to model speech [17] and echolocation of bats [18] as well as gravitational waves [19]. The results 
regarding the properties of the AWT derived in this article will thus be relevant to the analysis of signals from a number of 
fields. 

\ A broad class of interesting signals may be modeled as modulated oscillations, with the analytic signal as the foundation 

1 I [20], [21]. One wishes to recover the properties of the signal, without specifying a parametric model for its structure, based on 

a generally noisy or contaminated observation. Of particular interest are the time-varying amplitude and phase of the analytic 
• • signal as well as their derivatives. For contaminated signals the direct construction of the analytic signal via the Hilbert transform 
. !^ ' leads to disastrous results as the amplitude and phase will then reflect the aggregate properties of the multi-component signal 
[22]. It is necessary to isolate the signal of interest while simultaneously rendering it analytic. The AWT is a recipe for 
H constructing a family of versions of a time series which are both localized and analytic. 

. . A second analysis step, termed wavelet ridge analysis [2], [3], then identifies a special set of points from which the properties 
of an underlying analytic signal can be accurately estimated. This method can exhibit excellent performance, principally on 
account of its insensitivity to signal contamination due to the time/frequency localization of the wavelets. But the price of 
the localization is that the analytic signal is no longer precisely recovered in the absence of contamination, unlike direct 
construction of the analytic signal — i.e., bias is introduced in the estimation procedure. The departure of the estimated analytic 
signal from the true analytic signal is negligible if the signal modulation over the time support of the wavelet is also negligible 
[3], a strong constraint since many real-world signals exhibit substantial modulation. 

The purpose of this paper is to determine the exact properties of the AWT, and the resulting ridge-based signal estimation, 
for local analysis of oscillations with non-negligible modulation. Although Mallat [3] derives error bounds for analytic signal 
estimation in the weakly modulated case, time-dependent errors due to moderate or strong modulation have not yet been 
considered. Understanding this modulation-induced bias is important in order to correctly interpret the amplitude and frequency 
estimates provided by the wavelet ridge analysis. These results could also be used as a guide in choosing wavelets which 
explicitly minimize bias effects. 
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The structure of the paper is as follows. Necessary background is presented in Section [III Section |lll] then introduces a novel 
representation of a modulated oscillatory signal as a series of departures from a pure sinusoidal oscillation. In Section HV] a 
general expression for the AWT of a modulated oscillation is then derived. This result is used in Section [V] to examine the 
deterministic bias properties of ridge-based signal estimation. The paper concludes with a discussion. 

All numerical code associated with this paper is made freely available for use by others, as noted in Appendix U 



II. Background 

This section reviews the specification of the amplitude and phase of a modulated oscillatory signal via the analytic signal, 
and their estimation by the wavelet ridge method [2], [3] using a general family of analytic wavelets [23], [24]. 

A. Modulated Oscillations 

A real-valued amphtude- and frequency-modulated signal may be usefully represented as [25] 

xit) = a+(t) cos (1) 

with the amplitude a^{t) and phase cj)+{t) defined in terms of the analytic signal [20]. The analytic signal is specified in the 
frequency domain by 

1 

x+{t) = — / X+{Lj)e'^' du; (2) 

where we have introduced 

X+{lu) = 2U{lo)X{uj) (3) 

with U (uj) being the Heaviside unit step function. 

The construction of the analytic signal a;+ (t) permits the amplitude a+ (t) and phase </>+ (t) to be uniquely definecQ via 

a+(i)e*^+(*) =a;+(i) (4) 

and the original signal is recovered by x{t) = While more than one amplitude and phase pair may yield the same 

real-valued signal in ([T]), the symbols a+{t) and (j>+{t) denote the so-called canonical amplitude and canonical phase [26] 
associated with the analytic signal. The conditions under which a given amplitude and phase can be recovered in this manner 
have been examined by [27]. The rates of change of the amplitude and phase are quantified by 

. »{| ,.,.,(,)} ^^±11 ,5, 

u(t) s aj^ lni+(t)| =*V(t) (6) 

which are referred to as the instantaneous bandwidth [28] and instantaneous frequency [25], [29], [30], respectively. These two 
fundamental instantaneous quantities have an intimate connection to the first two frequency-domain moments of the signal's 
spectrum; see e.g. [31] and references therein. 

It frequently arises that one wishes to estimate the instantaneous properties — a+{t), 4>+{t), v{t) and uj{t) — of a modulated 
oscillatory signal x{t) believed to be present in a noisy observation. Typically one is presented with an observed time series 
a;'-°^[i„] at discrete times <„ e ti, t2, ■ ■ ■ ,tN 

x^°^[tn]^xitn)+x<^'^[t„] (7) 

where x{tn) is a discretely sampled modulated oscillation and x*^'^ is a discrete noise process. Given the noisy observed 
SIS one wishes to estimate the properties of the modulated oscillation x{t). A powerful method for accomplishing 

this task is wavelet ridge analysis, described subsequently. This method extracts an estimate of a modulated oscillation from 
the analytic wavelet transform. In the estimation procedure, there are three sources of error: 

(i) Errors associated with the discrete sampling; 

(ii) Random errors due to the noise x'*''[i„]; and 

(iii) Errors dependent upon the oscillation x{t) itself. 

The purpose of this paper is to examine this third type of error, which may be called "bias". Henceforth we assume that the 
noise process x^'^^ [tn] vanishes and that the sampling is perfect, and consequently we work in continuous time. 

'Note that at isolated points where ax{t) = 0, the value of the phase is typically defined by continuity [26]. 
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B. The Analytic Wavelet Transform 

A wavelet ipit) S is an analyzing function used to localize a signal simultaneously in time and frequency. By 

definition, a wavelet has zero mean and finite energy, and additionally satisfies the "admissibility condition" [32] 



dw < 



where is the Fourier transform of the wavelet. The wavelet transform of a signal x(t) G 

onto rescaled and translated versions of 



■r 



x{t) dr 



(8) 



is a series of projections 



(9) 



which are indexed by both the time parameter t and a scale parameter s; the asterisk denotes the complex conjugate. Note the 
choice of a 1/s normalization rather than the more common as we find the former to be more convenient for oscillatory 

signals. 

Here we will consider only wavelets which vanish for negative frequencies, i.e. which have '^{uj) — for cj < 0. Such 
wavelets are called analytic^ and (|9]l then defines the analytic wavelet transform (AWT). Alternatively the AWT may be 
represented in the frequency domain as 



27r Jo 



(10) 



with the integration requiring only positive frequencies on account of the exact analyticity of the wavelet. The analytic wavelet 
'^{llj) has a maximum amplitude in the frequency domain at uj — uj^, which is called the "peak frequency". We choose to set 
the value of the wavelet at the peak frequency to ^(cj^,) = 2, since then with x{t) = aoCos{uJot) we have the convenient 

result \W^ {t,uj^,/ujo)\ = \ao\- 

A useful way to categorize wavelet behavior is through normalized versions of the derivatives of the frequency-domain 
wavelet. We define the wavelet's dimensionless derivatives as 



(11) 



where the superscript "(n)" denotes the nth derivative. We will consider only wavelets for which the second derivative at the 
peak frequency ^'"((jj^,) is real-valued, in which case it is also negative since ^'(cj^) is a maximum by definition. Then 



P,/, 




(12) 



defines a real-valued quantity which is a nondimensional measure of the wavelet duration. 

/tt may be shown to be the number of oscillations at the peak frequency uj^p which fit within the central time window of 
the wavelet [24], as measured by the standard deviation of the demodulated wavelet tp{t)e~'^'^'*'* . Approximating the wavelet 
by its second-order Taylor expansion about uj^ gives 



p2 



(13) 



and so we have '3/ (a;^(l ± l/P^f,)) /^'(a;^) w 1/2. Thus the inverse duration can also be seen as a nondimensional 

measure of the wavelet bandwidth. After this second-order description of the wavelet, (cj^,) offers the next-higher-order 
description, and can be interpreted as quantifying the asymmetry of the wavelet about its peak frequency [24]. 



C. A General Family of Analytic Wavelets 

To examine the role of the analyzing wavelet in shaping the performance of wavelet ridge analysis, we will need a general 
family of analytic wavelets whose properties may readily calculated. The generalized Morse wavelets [23], [24], [34] are such 
a family. These wavelets are defined in the frequency domain by 

^ = U{Lu)ap,^LO^e-^" (14) 

where ,y is a normalizing constant and U{llj) is again the unit step function. The generalized Morse wavelets are controlled 
by two parameters, /3 and 7, the roles of which in shaping wavelet properties were examined in detail by [24]. To be a valid 
wavelet one must have /3 > and 7 > 0. By varying these two parameters, the generalized Morse wavelets can be given a 

^This terminology reflects the fact that, if ^(tj) has no support on negative frequencies, 'tl>{z) will be an analytic function of a complex argument z; see 
the discussion in Appendix 1 of [33]. 
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Examples of Generalized Morse Wavelets 




Frequency / Peak Frequency 



Fig. 1. Examples of the generalized Morse wavelets. Panels (a-d) and (f-i) show wavelets in the time domain for different (/3, 7) values, which are indicated 
in the lower left corner of each panel; for presentation, the wavelets are rescaled by their maximum amplitude. The real part is a solid line, the imaginary part 
is dashed, and the modulus is a thick sohd line. The frequency-domain versions of the wavelets in the top and bottom rows are then given in panels (e) and 
(j) respectively. Three wavelet which will be used in the future, (a-c), are set off with a box. The upper row of wavelets shows the effect of increasing /3 with 
7 fixed at 7 = 3, hence 'i'3-i3,-/{i^i3,-y) vanishing; here Pfj,-y = y^Wf takes on values of \/3, 3, 3\/3, and 27. The lower row shows the effect of increasing 
7 and decreasing (3 such that Pf}^^ remains fixed at Pf}^^ = 3; here ^^.^ .^{iLjp .y)/'if2;i3,-y{'^f3,-^) is equal to -2.7, -2.0, 3.0, and 14.9, respectively. 



broad range of characteristics while remaining exactly analytic. Note that we will replace the subscript "ip" with "/3, 7" to 
denote quantities pertaining to these wavelets. 

Simple expressions for important properties of the generalized Morse wavelets are given in [24]. The peak frequency is 
Ufs.-y = {Pl^)^^^ , and choosing = 2{e^/ (3)^/'^ , with e being Euler's number 2.7182 . . ., then gives ^ 13 ^^{u 13 — 2. The 
dimensionless duration Pp,^ of the generalized Morse wavelets is 

^^3,7 = \/-$2;/3,7(w/3,7) = (15) 

while the third-order dimensionless derivatives at the peak frequency is 

$3;/3,7('^/3,7) = -(7-3)F|,^. (16) 

A general expression for n;p ,-y{'-^ i3 ,1) of any order may be found in [24]. 

In fact, these wavelets form a very broad family that subsumes many other types of wavelets. It was shown by [24] that the 
generalized Morse wavelets encompass two other popular families of analytic wavelets: the Cauchy or Klauder wavelet family 
(7 = 1) and the analytic "Derivative of Gaussian" wavelets (7 = 2). The diversity of the generalized Morse wavelets is due 
to the fact that they are a function of two parameters, j3 and 7, hence their second-order and third-order properties Pp^ and 
^3;/3,7('j-'/3,'y) may be independently varied. 

Examples of the generalized Morse wavelets are shown in Fig. [T] The upper row shows the effect of increasing (3 with 
7 fixed at 7 = 3, hence '^j.-.p.-^iojp^-y) vanishes but Pp,^ = \fP^ increases from left to right. The wavelet becomes more 
oscillatory in the time domain, or more tightly peaked in the frequency domain. The lower row shows the effect of increasing 
7 and decreasing (3 such that Pp,^ remains fixed at Pp^^ — 3, but with '^7i;i3rii^l3,i) increasing from negative values on the 
left to positive values on the right. With Pp^-y fixed, the number of oscillations within the central window does not change, but 
the long-time behavior of the wavelet changes considerably; in the frequency domain, an enhancement to the right of the peak 
shifts to the left of the peak as 7 increases. This illustrates the effect of independently varying second-order and third-order 
wavelet properties. 
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D. Wavelet Ridge Analysis 

The idea of wavelet ridge analysis [2], [3] is that there exist special time/scale curves, called wavelet ridge curves or simply 
ridges, along which properties of a modulated oscillatory signal are accurately represented. Unlike the instantaneous frequency 
curve, which is not known, the ridge curves are based on properties of the transform itself and can be readily located. The 
AWT evaluated along the ridge constitutes an estimator for a presumed modulated oscillatory signal. 

A ridge curve is based on an aggregation of points known as ridge points. Two separate definitions of ridge points are in 
common use. 

Definition 2.1: Ridge Points 
An amplitude ridge point of {t, s) is a time/scale pair {t, s) satisfying the two conditions 

^3?{lnW^(t,s)} = (17) 

— ^{\nW^{t,s)} < 0. (18) 

Since ^{\nWt(, {t, s)} = ln\W^, {t, s)\, these conditions state that for each fixed time point t, an amplitude ridge point 
corresponds to the scale at which a local maximum in the transform magnitude occurs. 

Similarly a phase ridge point of W^p {t, s) is a time/scale pair {t, s) satisfying the two conditions 

^ 9{lnW^^(i,s)}-^^ = (19) 



d_ 



dt ^-■■^^-"'J s 
ot s 



< 0. (20) 



Condition ( fT9] l states that the rate of change of transform phase matches uj^ /s, which is interpretable as a frequency associated 
with scale s; see [24] for a discussion of this point. Note that condition ( |20] l. like ( fTSl l. has the effect of identifying amplitude 
maximum rather than minima. While ( |20l ) is not standard, we introduce it here in order that the phase ridges may be defined 
without reference to the transform amplitude. 

Ridge points are then grouped into sets called ridge curves. 

Definition 2.2: Ridge Curves 

Let the set of all amplitude ridge points of some real-valued signal x{t) with respect to a wavelet 'ip{t) be denoted while 
S^'P^ denotes the set of all phase ridge points. Henceforth we will use the notation such as S^'\ with a superscript "•" referring 
to either "a" or "p". Then a ridge curve s^'^{t) is a scale curve as a function of time which maps out a contiguous collection 
of individual ridge points. The ridge curve is defined over some time interval T^^ and is constrained to additionally satisfy 
the continuity condition 



dt 



< oo. (21) 



This latter condition excludes discontinuities in the scale curve s^'^{t) as a function of time, as well as multiple values of 
scale at a particular time. 

The union of all ridge points is also known as the wavelet skeleton of the signal [35, pl4-18]. An estimate of a modulated 
oscillatory signal may then be constructed by evaluating the wavelet transform along the ridge curve. Here we have assumed 
the presence of a single modulated oscillation; signals consisting of a superposition of such oscillations may be treated in a 
similar fashion provided the instantaneous frequency curves are sufficiently separated in time and/or frequency [5]. 

Definition 2.3: The Ridge-Based Signal Estimate 
The amplitude or phase ridge-based signal estimate is 

= w:^ (t,s{->(<)) teT{-> (22) 

which is the set of values the wavelet transform takes along the ridge curve. This simple form is due to the 1/s normalization 
and the choice ^{u}^) = 2 introduced in Section Hl-BI 

It was shown by [3] that the error in x^^^(t) becomes negligible when v{t), v'{t), and u>'{t), together with a fourth term 
involving broadband bias, all tend to zero. However, the time-varying form of the error for non-vanishing modulation strength, 
and the conditions governing an appropriate choice of wavelet for a given signal, have not yet been examined. 



E. Application to Oceanographic Data 

An example of wavelet ridge analysis is shown in Fig. |2] The data, the uppermost time series in Fig. |2^-c, is the eastward 
velocity recorded by a freely drifting subsurface oceanographic float [36]-[38]. Such instruments are an important means of 
tracking the ocean circulation, and this and other such data may be downloaded from the World Ocean Circulation Experiment 
Subsurface Float Data Assembly Center (WFDAC) at http : //wf dac . whoi . edu. The oscillatory nature of the signal 
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Ridge Analysis with 7^3, p=1.5 



Ridge Analysis with 7^3, (3=3 



Ridge Analysis with 1^3, (3=7 




-500 500 -500 
Day of Year 1986 



-500 500 -500 
Day of Year 1986 



-500 500 -500 
Day of Year 1986 



Fig. 2. Example of wavelet ridge analysis. The first row (a-c) shows the data together with the resulting ridge-based signal estimate and the residual; these 
are offset with the data (which is the same for all panels) at the top, the estimated signal in gray in the middle, and while the residual at the bottom. The 
thi'ee columns correspond to the use of three different wavelets, specifically those shown in Fig. [T^-c. The three wavelet transforms are shown in (d-f) with 
the ridge curves marked as heavy lines. The final two rows are used in a subsequent section. The third row (g-i) shows the instantaneous frequency (solid) 
and bandwidth (dashed) of the estimated signals; these quantities are estimated as described later in Section |V] The fourth row (j-1) shows the estimated 
real (solid) and imaginary (dashed) parts of the second instantaneous modulation function P2{t), along with the contribution (t) / u!-^ (t) due to the squared 
bandwidth (heavy curve). Dotted lines are plotted at it-|P^ ^, which as discussed later is a measure of the degree of variability appropriate for each wavelet. 



reflects the presence of an oceanic vortex [e.g, 39], properties of which may be inferred from the modulated osciflation as in 
[11]. This particular record has recently been used as an example in other studies [31], [40]. More details regarding the data 
and its interpretation may be found in [31], but here we shall merely take it as an typical example of a modulated oscillation 
in noise. 

Three wavelet transforms are shown in Fig. |2}l-f using the three wavelets shown in Fig. [T^-c, together with locations of 
amplitude ridges. The resulting signal estimates and the differences between the original time series and the signal estimates 
are presented in Fig. |2^-c. The data is actually recorded as position, and the wavelet transform applied to this position record, 
but for clarity the time derivatives are presented in Fig.|2^-c as these emphasize the oscillatory structure, rather than the lower- 
frequency meandering behavior which is also present. The wavelet transform is taken at 74 logarithmically-spaced frequencies 
between radian frequencies 0.12 and 2.39; the data sample interval is one day. All amplitude ridges are found whose length 
exceeds 2P^ — that is, 2\/4.5 = 4.2, 2%/9 = 6, and 2^/21 — 9.2 cycles for Fig. |2^, b, and c respectively. In all three cases 
there is only one such ridge, which extends nearly throughout the entire record. 

All three of the estimates appear reasonable, and they are not drastically different from one another However, the residual 
curves in Fig. |2^-c reveal some differences, with variability of the residual — and smoothness of the estimated signal — increasing 
as Pp^j increases. Also, a major low-frequency fluctuation near time t — appears to have been missed by the smoothest 
estimate in Fig. |2};. An important issue is the ability to compare these signal estimates against one another to decide which is 
to be preferred; this is accomplished in Section IV-DI using the results of the subsequent development. Discussion of the last 
two rows in Fig. |2] will be left until later 

F. Outstanding Questions 

This section has presented essential elements of wavelet ridge analysis for modulated oscillations, as introduced by [2] and 
extended by [3]. A number of questions may immediately be asked: 

(i) What is the form of the time-varying bias terms in the ridge-based signal estimate? 

(ii) Are the amplitude and phase ridges the same, and if not which shows superior performance? 

(iii) How should the wavelet properties be chosen in order to minimize bias? 
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Addressing these questions is the goal of this paper, which we accompUsh with special attention to the generalized Morse 
wavelets and using the data in Fig. |2] as an example. 

III. Representation of Modulated Oscillations 

In this section a local expansion of an analytic signal is constructed. The hierarchy of terms in this expansion quantify 
increasingly higher-order local deviations of the signal from a constant amplitude, constant frequency sinusoid. 

A. A Local Representation 

Definition 3.1: Instantaneous Modulation Functions 
We aim to express the local variation of an analytic signal as a series of departures from a uniform oscillation. To this end 
we define functions for n — 1,2, . . . as 

1 1 (V" r n 
p„(i) = — — —— x+{t + T)e-'^^'^^ (23) 

where t on the right-hand-side is interpreted as a reference or "global" time, while r is a "local" time. The Pn{t) are the 
T-derivatives, evaluated at r = 0, of x+ {t + r) demodulated by a uniform oscillation in local time r having a frequency equal 
to the signal's instantaneous frequency uj{t) at the global time t. Division by powers of renders the /0„(t) dimensionless. 
These functions will be called the instantaneous modulation functions, and are to play an important part in what follows. We 
are now in a position to state the following theorem. 

Theorem 1: The Local Modulation Expansion 
Let x+{t) be an analytic signal defined by (|2]l for a real-valued x{t) e I/^(R). Assume that x^{t) G (jN+i i.e. 
that x^{t) is iV + 1 times differentiable on the interval [t, i + r] for some "truncation level" N = 0,1,2..., and also that 
a;+ {t) ^ on that interval. Then x+ {t + r) may then be expressed as an TVth order Taylor expansion in local time r 

N 



x+{t + T)=x+{t)e"^^''>' 
where the form of the residual term 



1 + V 1 [c^(i)T]" p„(t) + Rn+i{t, t) 
=1 



(24) 



N+l 



RN+l{T,t) = ' ^^'^^y PN + lit') t'e[t,t + T] (25) 

is found by employing the Lagrange form of the remainder in Taylor's theorem [41, p880]. When x{t) is an oscillatory 
signal, this expansion of a demodulated version of x+ (t) can be expected to converge much more rapidly than a direct Taylor 
expansion of x+{t) itself. 

Proof: The local modulation expansion is the Taylor series expansion of the complex-valued function 

with respect to the variable r about the point t — 0. ■ 
In the vicinity of some global time t, (l24l l represents the variation of x+ {t + t) with respect to local time r as a series of 
departures from a pure complex oscillation at the fixed frequency U!{t). The nth-order instantaneous modulation function Pn{t) 
thus gives the contribution to the deviation of the signal from a pure sinusoid at nth order in the dimensionless local time uj{t)T. 
The instantaneous modulation functions are interpretable as fundamental quantities describing the deviation of the signal from 
a uniform oscillation. For a constant-amplitude, constant-frequency sinusoid x{t) ~ Oq cos(ijjoi), the instantaneous modulation 
functions of all orders are well-defined and vanish identically everywhere. Signals which are considered "oscillatory" in the 
vicinity of some time t should therefore have the instantaneous modulation functions p„(t) not being too large. 

B. The Instantaneous Modulation Functions 

It remains to find the form of instantaneous modulation functions. In the following, we find it convenient to group the 
instantaneous frequency and bandwidth into a single complex-valued quantity 

r]{t) = uj{t)-iv{t) = -i^\nx+{t) (26) 

which we term the complex instantaneous frequency . The definition (l26b emphasizes that u){t) and v{t) are related as the 
imaginary and real parts, respectively, of the time derivative of h\x+{t). Since u;(t) and v{t) often occur together as the 
complex-valued quantity ?7(t), this grouping will simplify subsequent expressions. 
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To find expressions for the instantaneous modulation functions, we first note that the bandwidth may be expressed as 

d 



v{t) 



dr 



In 



T=0 



while the complex instantaneous frequency 77 (i) has an nth derivative given by 

d" 



IT] 



(«-l) 



it) 



dr"" 



In 



n > 1. 



T = 



(27) 



(28) 



Now assuming In x+ (t) to be infinitely differentiable in the neighborhood of time t, (|24] | for infinite N may be rewritten as 

exp{ln[x+(t + T)e-'^"W^] ~ lna;+(t)} = 1 + ^ - [c^(i)r]" pn (0 (29) 



which becomes, upon Taylor-expanding the exponent of the left-hand side, 

exp < v{t)T + 



ra=2 ' J ri=l 



[t). 



(30) 



This expression indicates a relationship between the instantaneous modulation functions, appearing on the right-hand side, and 
the instantaneous bandwidth and derivatives of the complex instantaneous frequency on the left-hand side. 

C. Expressions Using Bell Polynomials 

To derive closed-form expressions for the instantaneous modulation functions, we turn to a special set of functions called the 
complete Bell polynomials [42], [43]. The complete Bell polynomial i?„, operating on n arguments ci, C2 . . . , c„, is defined 
to give the coefficients appearing in the expansion 



f °° 1 1 °° 1 



1 ^ri) 



(31) 



with Bq = 1. Expressions for the first four Bell polynomials 

Bi(ci) = ci 



cl 



C2 

3ciC2 



C3 

4ciC3 



C4 



(32) 
(33) 
(34) 
(35) 



B2{CI,C2) 
i?3(ci,C2,C3) = 
-84(01, C2, C3, C4) = Cl 

can be verified directly by expanding (|3TT i and equating powers of t between the left-hand and right-hand sides. More generally, 
the Bell polynomials satisfy a recursion relation [42] 

n-l 

^?n(ci,C2, . . . ,C„) = 



p=0 



p 



_p Bp{ci,C2, 



(36) 



for ri > 1 given any ci, C2, . . . , c„. 
Comparing of (|30] l with ( |3l1 l we find 



/9„(t) = B„ 



(37) 



as an expression for the Tith instantaneous modulation function in terms of the nth-order Bell polynomial operating on the 
bandwidth v{t) and the first n — 1 derivatives of 77 (i). From ( |321434] | one then obtains 



Mt) 



Uj{t) 

v\t) 
v^{t) 



Mt) iri'{t) 







(38) 
(39) 
(40) 



as the first three instantaneous modulation functions. 

The nth instantaneous modulation function Pn{t) thus combines powers of the bandwidth v{t) and powers of time derivatives 
of the complex instantaneous frequency 7]{t) into a measure of the nth order departure of the signal from a uniform oscillation. 
On account of the nondimensionalization by powers of ijj(t), we can interpret the rates of change of the amplitude and phase 
involved in pn{t) to be on time scales proportional to the local instantaneous period 2Ti/u){t). The first of these, p\{t), is 
simply a nondimensional form of the bandwidth v{t). In general Pn{t) is complex-valued for n > 1. 
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D. Examples 

As an example, consider the complex instantaneous frequency specified by 

-qit) = oj{t) - iv{t) = t^o [1 - re''^''] 

where and loi are real-valued constants, and r is a potentially complex-valued constant with \r\ 
nth-order derivative of the complex instantaneous frequency is 



77(")(t) 



,n+l 



it) 



LUl 



(41) 

< 1. The normalized 
(42) 



|c^(t)|"+' Ht)\ 

This quantity decreases with increasing n whenever \uji\, the frequency at which the complex instantaneous frequency oscillates, 
is smaller than the local instantaneous frequency \i^{t)\ itself. If however \lui\ exceeds then derivatives of the complex 

instantaneous frequency will grow with n, eventually becoming non-negligible no matter how small one takes |r|. Rapid 
fluctuations of the instantaneous frequency or bandwidth therefore cause the instantaneous modulation functions to fail to 
decay with increasing n. 

As another example we return to the data analyzed in Fig. |2] Panels (g-i) present the instantaneous frequency uj{t) and 
bandwidth v{t) associated with the three estimated modulated oscillations in (a-c), while the corresponding values of p2{t) and 

(t) are shown in (j-1). The instantaneous modulation functions pi{t) and p2{t) reveal the nature and magnitude of the first- 
and second-order modulation of the three estimated signals. The most dramatic change is the increasing degree of smoothness, 
corresponding to the use of longer-duration wavelets, as one proceeds from left to right. The degree of amplitude variability 
increases in the latter half of all records, where the frequency has also increased, but we note from (g-i) that the amplitude 
modulation rate is generally very small compared to the instantaneous frequency, i.e. v{t) << oj{t). Now, writing out 
one finds 

^, , ^ v'{t) , luj'it) 



hit) = pi{t) 



(43) 



which shows that p\{t) is a contributor to 'p2{t)- Since panels (g-i) show that the bandwidth is very small compared to 
the instantaneous frequency, it is not surprising to find in (j-1) that 'p2{t) contains a negligible contribution from = 
{t) / uj"^ {t) . Instead we find that the real and imaginary parts of 'p2{t) are roughly equal, implying comparable contributions 
from v'{t) and Lo'{t) in all three estimates. 

E. Signal Variability 

Using the instantaneous modulation functions we may now quantify the degree of departure of a signal from a uniform 
oscillation. 

Definition 3.2: Local Signal Stability Level 
Choose a truncation level Nt for the local modulation expansion which is fixed over some time interval T. Then the stability 
level is defined as the smallest positive constant which satisfies 



v{t) 



Uj{t) 
./n-l)(t) 



< 



< 



'Nt: 



V t G T 



V 



(44) 



(45) 



teT, 2<n<NT. 

It is clear from ( |38] - |40| ). together with the recursive form of the Bell polynomials ( |36] |. that these conditions imply 

Pn{t) = 0{6l^) teT l<n<NT (46) 

with powers of relative bandwidth contributing at the same order as derivatives of the complex instantaneous frequency. 
Furthermore, note that 



also imply 
1 d 



because, for example, 

1 d 



hit) = 



uj{t) dt 



1 d 



Pn{t)^nxO{6ll') teT l<n<NT 



v^{t) +ir]'{t) 



2v{t)v'{t) +iri"{t) 



(47) 



(48) 



Lu{t) dt'''"'-^' Lu{t)dt 

and similarly for higher-order n. 

The local stability level S^^ is determined by the variability of the signal, and may be different for different choices of 
truncation level Nt- Thus the local stability level (Jat^ is a single number describing the extent to which any square-integrable, 
Nt + 1 times differentiable real-valued signal x{t) departs from a uniform oscillation at up to and including A^th order. 
When J^Vt ^ 1 it will be possible to obtain a greatly simplified representation of the AWT. This is the key to obtaining direct 
closed-form expressions for the effect of signal modulation on the AWT and the ensuing ridge-based signal estimates, as we 
address in the next section. 
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F. Wavelet Suitability Criteria 

The local stability level can be used to match a wavelet to a signal in such a way that the wavelet ridge analysis will yield 
an accurate estimate of the signal. The rational behind the "wavelet suitability criteria", which we now define, will become 
more clear when we apply them to the analytic wavelet transform in the next section. These conditions constrain the choice 
of wavelet appropriate for a given oscillatory signal. 

Assumption 3.1: Wavelet Suitability Criteria 
Consider a signal characterized by local stability level Snt with truncation level Nt over some time interval T. Given these 
quantifications of the signal's variability, we match a wavelet to the signal as follows. We assume that the frequency-domain 
derivatives of the wavelet satisfy the following criteria at the peak frequency for n> 2 

< 1 feN (49) 
^,.-i)/2M^ < 1 ^GN. (50) 

When Snt is small, the quantities (l/7i!)5'„(tj^) are permitted to grow with increasing n since then the powers of S^^ become 
smaller with increasing n. The lowest-order suitability criteria, at rt = 2, implies that — |^2|^^^ < \/ 2 / 6nt ■ This means 
that the time-domain wavelet is constrained so that is not too long — i.e. does not contain too many oscillations — compared to 
the degree of time variability of the signal, or alternatively, that the frequency-domain wavelet is constrained so that it is not 
too localized about the peak frequency. 

Note that (l49]450l l place a tighter condition on odd moments than on even moments. This will simplify our analysis, and 
is reasonable because one expects the odd moments — which quantify the degree to which the wavelet departs from symmetry 
about its peak frequency — will be small for useful wavelet functions, as discussed earlier. Since ^i(u;0) vanishes by the 
definition of w^, the lowest-order odd derivative to which these conditions apply is the third-order quantity ^^{uj^). 

G. Suitability of the Generalized Morse Wavelets 

Next we find a range of parameter space for which the generalized Morse wavelets satisfy the wavelet suitability criteria. 
Let us choose the wavelet such that ,^ = y/2/5N^ for some local stability level Snt- Thus for a given (^jv^. this fixes a 
curve in (/3, 7) space along which the lowest-order (n — 2) suitability criterion is satisfied, and we must ask where along this 
curve the higher-order suitability criteria are also satisfied. In Fig. |3] we plot 

(P|V2)^("^^^/^ ^"=^'>^-^ e 7L 

for different values of the doublet (/3, 7), with /3 > 1 and 7 > 1 and for n > 2. If these two quantities are less than unity and 
also P/3_-y = 1/2 / bjq^ , then comparison with (|49]450l l shows that the wavelet suitability criteria are satisfied. 

A difference in behavior is seen for 1 < 7 < 6, and other values of 7. For 1 < 7 < 6 the normalized wavelet derivatives 
are always less than unity and decay rapidly with increasing n. For other values of 7 this is not the case, and we see that 
unity is occasionally exceeded at n = 3, 5, 7, 9. Also, outside the region 1 < 7 < 6 the rate at which the plotted terms decay 
with increasing n is noticeably slower Thus if we are presented with a signal characterized by a local stability level over 
some time interval T and for some truncation level Nt, we can choose a generalized Morse wavelet to satisfy the wavelet 
suitability criteria by setting Pp.-^ < \/2/5nt and choosing any (/3,7) pair with (3 > 1 and 1 < 7 < 6. An application to the 
data in Fig. |2]will be given later. 

IV. Analysis of Modulated Oscillations 

The goal of this section is the derivation of an expression for the analytic wavelet transform (AWT) of a potentially highly 
variable signal x{t) which makes exphcit the interaction between the analytic signal x^{t) and the wavelet. 

A. Additional wavelet properties 

Measures of the wavelet time-domain support and long-time decay will be needed. The energy fraction function 

gives the ratio of the wavelet energy in a time window of half-width L to the total energy. The energy fraction is inverted by 
the time support function L^{a) 

L^{a)=al^{a) (52) 
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Morse Wavelet Decay withp>l, l<'y< 6 
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Fig. 3. Decay of normalized wavelet frequency-domain derivatives, as discussed in the text. Each point shows the normahzed nth frequency-domain derivative 
for a wavelet i/'/3,7(t) wi'h integer /3 in the range 1-21, and integer 7 in the range 1-11. Values for 1 < 7 < 6 are marked by large black circles, with values 
for 7 > 6 being gray dots. The heavy gray lines show 7 = 3, beginning at n = 4 since the n = 3 terms vanish for these wavelets. 



where the exponent "—1" in this context denotes the inverse function. L^{a) associates a wavelet half-width with a given 
energy fraction, such that a is the fraction of the wavelet energy inside the time window \t\ < L^{a). The long-time decay 
of the wavelet is specified by 

m)/m\ - \tr^ (53) 

for some constants 7-^, > 0, which takes on the value rp^^ —13+1 for the generalized Morse wavelets [24]. 
B. Theorem 

Using the results of the preceding section, we may state the following theorem, which gives the exact form of the AWT of 
a potentially highly variable signal with a general analytic wavelet. 

Theorem 2: The AWT Representation Theorem 
Fix (i, s) e M X M+, choose a truncation level Nt S N such that the wavelet decay satisfies > Nt + 2, and also an energy 
fraction a specifying a time support L^(a). Assume that 

In [x+{t)] e C^^+i [t-sL^{a),t + sL^{a)] 

which implies that |a;+(/;)| ^ over the same interval. The AWT of the real-valued signal x{t) is then 

where £^_jv+i(t, s) is a transform residual given by ( llOll l in Appendix Ull 

Proof: The proof is provided in Appendix [III together with bounds on the transform residual. For an intuitive illustration 
of the basic idea, here we prove an idealized special case. In this paragraph we take ip{t) to be a filter that is exponentially 
decaying in time, which means it cannot be an analytic function; thus a term arising from non-analyticity of ip{t) emerges 
here but not in ( l54l i. The analytic signal is assumed everywhere infinitely differentiable and non-vanishing. After a change of 
variables, and noting x{t) — [x+{t) + x*^_{t)]/2, the wavelet transform (|9|l becomes 

W4t,s) = l- f r (-)[x+it + r)+x*^{t + T)] dr = t^^,,^(i,s) + W^,,;(t,s) (55) 



n=l 



n 



(54) 
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in which we have implicitly defined an analytic portion W^^x+ {t, s) and an anti-analytic portion W^,x^ {t, s). For the former, 
substituting the local modulation expansion ( |24] | gives 



-oo ^ s / n. 



After the change of variables r/s ~ u, and exchanging the order of summation and integration, we have 



vI^*(Mi)) + E 



[zsw(t)T]"^*(r) e'"'"(*)" dr 



(56) 



(57) 



where we have introduced canceling factors of i" and (—«)". Now, the nth dimensionless derivative ( fTTT i may be written as 

1 



(58) 



by differentiating the Fourier representation of ^'(a;). Substituting this into (ISTl l obtains the right-hand side of (|54] | with 
infinite A^. Thus essentially ( |54] | arises by noting that taking AWT of the Taylor-expanded, demodulated signal involves forming 
the time domain moments, hence the frequency domain derivatives, of the analyzing wavelet. The proof in AppendixHIl handles 
the truncation to a finite number of terms N in the summation, together with complications arising from the polynomial rate 
of time decay of the wavelet. ■ 



C. Comments and Interpretation 

At this stage we offer some comments on the importance and interpretation of the preceding theorem, which we view 
as a fundamental result. The AWT representation theorem ( l54l i shows that the AWT is generated by the interaction of the 
frequency-domain derivatives of the wavelet with certain time-varying signal quantities — the instantaneous modulation functions 
introduced in the previous section. Each higher-order frequency-domain derivative of the wavelet ^'„(a;) interacts with a higher- 
order measure Pn{t) of the variability of the signal. The roles of amplitude and frequency modulation in setting transform 
properties are explicitly included. An advantage of the AWT representation theorem is that it permits us to compare different 
analytic wavelets for a given signal by comparing their frequency-domain derivatives. 

In contrast to previous works [2], [3], which assume that the signal bandwidth is small and that the bandwidth and 
instantaneous frequency are essentially constant, ( |54] | resolves the hierarchy of nonlinear terms and is therefore useful for 
a much broader variety of local signal behavior It can be seen as a substantial generalization of the pioneering work of Delprat 
et al. [2] and Mallat [3]. In particular. Theorem 4.5 of Mallat [3] is roughly equivalent to (|54] | for the case N — 0, that is, with 
the error term including everything except for the leading term of unity. Mallat's derivation assumes a particular form for the 
wavelet — a real-valued envelope multiplied by a complex exponential — which cannot be strictly analytic, but which mimics 
the form of the popular Morlet wavelet [32]. The original proof by Delprat et al. [2] of a result related to Mallat's relied on a 
stationary phase approximation, and similarly required the assumption of negligible modulation for both the wavelet and the 
signal. 



D. Compression Along Instantaneous Frequency Curves 

Here we use the AWT representation theorem to examine the wavelet transform along the instantaneous frequency curve, a 
key theoretical quantity which controls the behavior of the ridge-based signal estimator This sets the stage for the application 
to wavelet ridge analysis in the next section. 

Definition 4.1: The Localized Analytic Signal 
Evaluating the AWT along the instantaneous frequency curve yields a fundamental object reflecting the joint properties of the 
signal and the wavelet, 

x^{t) = W^{t,uj^/uj{t)) (59) 



(60) 

(61) 
1 term in 



which we term the localized analytic signal. From the AWT representation theorem (|54] |. we find immediately 



x^{t) = x+{t) 



1 



N 

E 

n=2 



[recalling ^'(a;^) = 2] where the residual in this expression 

ei>,JV+i (0 = £ip,N+i{i-,^-ii>l^{t)) 

is the transform residual appearing in (|54] | evaluated along the instantaneous frequency curve. Note that no n 
appears in (|60] | due to the fact that '^\{lo^) = by definition. 
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The localized analytic signal (t) is a non-uniform and nonlinear filtering of the signal by the wavelet. It can be seen as 
a sequence of local projections of the signal onto a set of analyzing functions, in which the analyzing functions — the rescaled 
wavelets tp{t/s)/s — are scaled to be proportional to the local instantaneous period. The localized analytic signal represents a 
non-uniform filtering because the analyzing wavelet changes in scale across time, and this filtering is nonlinear since the scale 
of the wavelet depends upon the instantaneous frequency of the signal being analyzed. It is clear that the localized analytic 
signal reduces to a linear time-invariant filtering when the instantaneous frequency is constant, since then it can be considered 
as merely the result of a convolution of the signal with some fixed wavelet function. 

In general, the localized analytic signal is not itself precisely analytic. Its analyticity is compromised on account of the 
localization. Taking the Fourier transform of (l60b . and noting that time-domain multiphcations become frequency-domain 
convolutions, we find that the Fourier transform of (t) is given by 

X^uj) = X+{lu) + V (~')"^»(^^) X /°° X+iu;')Pniij - io') dij' + S^^N+iico) (62) 

where P„(w) and £^ Ar-|_i(a;) are defined as the Fourier transforms of and of the product a;+(t)e^jv+i(t), respectively. 
If Pn{t) — for all n > 2, then X0(aj) has no support on negative frequencies; otherwise the convolutions may distribute 
energy to negative frequencies. In this manner the interaction between signal and wavelet can cause the localized analytic 
signal x^{t) to deviate from exact analyticity. 



E. Bias of the Localized Analytic Signal 

Making use of the local stability level, we may match the wavelet to the signal in such a way that the difference between 
x^{t) and x+{t) remains small. The localized analytic signal ( l60l l involves a hierarchy of interactions between the instantaneous 
modulation functions and the frequency-domain derivatives of the wavelet. In order that the localized analytic signal x^it) 
be close to the true analytic signal x+{t), these interaction terms must be kept small. At this point we invoke the wavelet 
suitability criteria (|49] l and dSOl l, introduced in Section ITlI-FI to obtain a simple expression for the bias of the localized analytic 
signal. 

Assume the signal is characterized by local stability level 5^^ for a truncation level Nt over a time interval T. Invoking 
the wavelet suitability criteria, the difference between the localized analytic signal and true analytic signal is for t e T 



Ax^{t) = ""'^^^^^(^"^^^^^ = -ln{^^)Mt)'^n{uj^)Mt) + l^m^Mt) + ^..^ w (63) 

[from (l54b l where we have set the truncation level set to Nt = 4. Thus with a suitable choice of wavelet, the deviation of the 
localized analytic signal consists of a series of terms representing increasingly higher-order interactions of the signal with the 
wavelet, which diminish with increasing order. 

Obtaining a small value of Aa;^ it) has two implications for the choice of wavelet. Firstly, as discussed in Section IIII-FI 
— i2'^2{^->p) — is a measure of the (squared) wavelet duration and should be chosen to be small in comparison with P2{t)- 
Note that this lowest-order contribution to lS.x^{t), at first order in Snt^ is associated with p2{t) rather than with this 
arises on account of the vanishing of '^i{lo) at the peak frequency lo^. Secondly, it is important to make an appropriate 
choice of wavelet with fixed so that the higher-order terms are small. For example. Fig. [T| shows that large absolute values 
of (w^) correspond to a high degree of frequency-domain asymmetry; thus the n — "i suitability criterion represents a 
bound on an acceptable degree of asymmetry. More generally, if the suitability criteria are satisfied then contributions from 
higher-order instantaneous modulation functions appear at higher orders than the leading term involving the duration P^, and 
may consequently be neglected when is small. 

It is instructive to examine the form of the amplitude and phase of the localized analytic signal if we keep only the 
lowest-order term in the expansion. With Nt = 2 we have 



x^{t) = x+{t) 



(64) 



recalling the definition ( fT2] i of P,p. The amplitude and phase of the localized analytic signal are implicitly defined via 

x^,{t) ^ a^,{t)e'^^^'^ (65) 

which, it should be pointed out, are not necessarily a canonical pair because x^{t) is not necessarily precisely analytic. We 
may introduce the deviations 

a^{t) = a+{t)[l + Aa^it)] (66) 

(jj^it) = 0+(i) + A0^(t) (67) 
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and inserting these expressions into f65[ . one obtains 

X4,{t) = x+{t) [1 + Aa^(i) +iA0^(i) + 0{A^i^(t)} +0{Aa^(OA<?!)^(t)}] . (68) 
Equating terms with ( |64] i then leads to 

A0^W = ^^0TW + OW,3W} + O(4J (70) 

for the ampHtude and phase deviation, respectively. 

In both cases the deviation is proportional to the second derivative, or curvature, of the quantity of interest. Since is 
a measure of the time duration of the wavelet, these deviations are small when the amplitude curvature and phase curvature, 
respectively, are small over the time support of the wavelet. Both amplitude and phase are underestimated during local maxima 
and overestimated during local minima. This is intuitive behavior since the localized analytic signal is essentially a smoothed 
version of the true analytic signal. 

In summary, to minimize the bias of the amplitude and phase of the localized analytic signal, one should make small 
relative to 'p2{t), while enforcing the wavelet suitability criteria. This raises the question of what is the smallest useful value 
for P^, an issue which will be addressed Section IV-DI 

V. Estimation of Modulated Oscillations 

Building on the results of the previous sections, we explicitly identify the hierarchy of time-dependent bias terms associated 
with estimation of analytic signal properties using the wavelet ridge method, and show how to choose wavelets which minimize 
these terms. 



A. Transform Near an Instantaneous Frequency Curve 

In the preceding section we defined the localized analytic signal, which is simply the set of values taken by the wavelet 
along the instantaneous frequency curve. In practice, however, this quantity is not known because the instantaneous frequency 
curve is not known. However the ridge curves defined in Section Hl-DI mav be identified directly from the transform, and these 
will be shown to closely approximate an instantaneous frequency curve. The ridge-based signal estimate is then found by 
evaluating the wavelet transform along a ridge: 



where the "•" is either an "a" or a "p" to refer to an amplitude ridge or a phase ridge, respectively, and where T^'^ is the 
time interval over which the ridge exists in practice. 

In order to obtain an expression for the bias of the ridge-based signal estimate, we must account for the deviation of the ridge 
s^'^ from the instantaneous frequency curve uj^/uj{t). To this end we employ another Taylor-series expansion and express the 
wavelet transform along the ridge in terms of the wavelet transform along the instantaneous frequency curve. There emerge 
powers of a quantity 

A^t,.)^f^-1 (71) 

which we refer to as the scale derivation since it gives the departure of a given scale from the instantaneous frequency curve. 
We then obtain the following result. 

Theorem 3: The AWT Scale Deviation Expansion 
With the same assumptions as the AWT representation theorem, and the additional assumption that ^'(a;) £ C°° (0, oo), the 
AWT representation theorem (l54l l can be cast in the form 



_m=0n=0p=0^ 

where now we define po{t) = 1 for convenience. This expansion involves a triple summation over orders of the wavelet 
derivatives evaluated at the fixed frequency u!^, orders of the instantaneous modulation functions, and powers of the scale 
deviation. 

Proof: The proof is given in Appendix |III] ■ 
The AWT scale deviation expansion relates the AWT of the signal along the instantaneous frequency curve to the AWT at all 
scales. It can be seen as expressing how the compression of the signal along the instantaneous frequency curve extends across 
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the time/scale plane. Using the local stability level, and the wavelet suitability criteria, we can simplify the scale deviation 
expansion ( |72] i in the vicinity of an instantaneous frequency curve. One final definition is also necessary. 

Definition 5.1: The Instantaneous Frequency Neighborhood 
The instantaneous frequency n-neighborhood is defined by 



n,,,4,{5N^) = {{t,s) : Auj{t,s) = 0(5]^^)} . 



(73) 



Thus, the n-neighborhood is a region of the time/scale plane within which the scale deviation is of nth order with respect 
to the local stability level Snt- As n increases, a region of the time/scale plane more tightly localized around uj^,/uj{t) is 
specified. This permits us to quantify the magnitude of the departure of a given scale point from the instantaneous frequency 
curve. We can now state the following theorem. 

Theorem 4: The AWT Ridge Representation Theorem 
Let x{t) e with t e T be characterized by stability level 5nt up to order Nt, and assume the wavelet is chosen such that the 
suitability criteria hold. Finally constrain the scale s such that {t,s) G 'R-2;ip{5nt)' implying the scale deviation is of second 
order in 6nt- We write the AWT of x{t) as 



{t,s)^ x+ {t)[l + Ax^, (t) + AW^ {t,s)] 



(74) 



which serves to define AW^(t, s). Thus W^{t,s) is separated into a scale-independent perturbation Ax^{t) and a scale- 
dependent perturbation AW^{t, s). The form of Ax^{t) was given earlier in ( l63T l. while by subtraction from ( f72] i we find 



0{5i 



0{Sl 



AW^{t,s) = -iAuj{t,s)pi{t)%{uj^)-Auit,s) 



3! 



o{sl 



+ i [Acoit, .3)f %{lu^) + O {S%^) + e^Ait, s) (75) 

for the form of the scale-dependent perturbation AWjf,{t, s). 

Proof: This theorem follows directly from the AWT scale deviation expansion ( |72] | truncated at = 3, together with 
the stated assumptions. ■ 

The AWT ridge representation theorem gives the form of the analytic wavelet transform in the vicinity of an instantaneous 
frequency curve, explicitly resolving the effects of modulation up to third order in Snt- An important point is that the scale- 
independent perturbation Ax^{t) contains the lowest-order term, at first order in (Jat^, while the scale-dependent perturbation 
AW^{t, s) contains only terms of second order or higher in 6nt- 

Note that the signal stability level (Jat^ has been used in several different ways. Its value is set from (I44lj45] l by the values 
of the derivatives of the original signal over some time interval T. These conditions involve up to the Nxth derivative of the 
signal, or the {Nt — l)th derivative of the complex instantaneous frequency; here Nt is a number that can be chosen, up 
to the degree of the signal's differentiability, and its choice will impact the value of 6nt that we find from the signal. The 
signal stability level 5^^. then constrains the choice of wavelet via the wavelet suitability criteria ( |49H50l l. Finally, the local 
stability level is also involved in the notion of the instantaneous frequency neighborhood (|73| l, which indicates a region of 
the time-scale plane within which a simplification of the AWT representation theorem may be found. The use of 6 Nt for the 
instantaneous frequency neighborhood has enabled an ordering of terms both on and off the instantaneous frequency curve 
using a single small parameter 



B. Expressions for the Ridge-Based Signal Estimates 

Using the AWT ridge representation theorem, we can now obtain closed-form expressions for the ridge curves and the 
associated estimate of the analytic signal. Henceforth, for simplicity, we assume that 5'„(aj^) is real-valued for n < 4, as is 
the case for the generalized Morse wavelet family of analytic wavelets [23], [24]. 

In Appendix [V] we find that both the amplitude ridge condition (fTTI l and the phase ridge condition ( fT9] l have unique solutions 
within the 2-neighborhood of the instantaneous frequency curve. The amplitude ridges are found to have the explicit form 



1 



v\t) v'{ty 




v{t) uj'{t) \ ^'4(cJv) 



1 v{t) Uj'{t) 



o 



(76) 
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while the phase ridges are given by 



Uj{t) 



1 L0"[t) V[t) Uj'it) 



O 



(77) 



where we have resolved terms up to second order in S^r^. Note truncation levels Nt = 4 and Nt = 3 have been used in the 
former and latter cases respectively. The forms of the residual terms 3^ (t) are given by (I143l l and (11501 ) of Appendix [V] 

These expressions have a rather complicated form, but there are two simple messages. Firstly, amplitude and phase ridges are 
definitely not the same, except perhaps for some special choices of wavelet properties. Secondly, from the wavelet suitability 
criteria it follows that all the resolved terms in ( |76] l and dTTl ) are of second order in Jat^; there are no terms at first order. 
Thus both types of ridge curves are of the form 



it) 



(78) 



where the ellipses indicate the omitted residual terms; here the "•" indicates either "a" or "p". This is important because 
the deviations of the ridge curves from the instantaneous frequency curve, and form each other, will be a higher-order effect 
compared to the first-order deviation of the localized analytic signal from the true analytic signal. 

An expression for the ridge-based signal estimate is found by substituting (iTSl l into (l75l l for /SW^{t,s). With a truncation 
level of Nt — 2, one finds 



x+{t) 



l + ^^^P2W+0(4.)+0{ey(<)} 



(79) 



which we note is identical, apart from the final residual term, to expression ( |64l ) for the localized analytic signal x^{t). Thus 
the leading order error term is due to the departure of the localized analytic signal — i.e. the AWT along the instantaneous 
frequency curve — from the true analytic signal, rather than the departure of the estimated instantaneous frequency curve from 
the true instantaneous frequency curve. We may alternately write ( |79] l as 



(80) 



which states that the ridge-based signal estimate accurately recovers the localized analytic signal. The difference between the 
two types of ridges will be of negligible importance when 5mt is small. 

To this theoretical result, we should add a caveat. While the perturbation analysis suggests there is no reason to prefer 
amplitude versus phase ridges, in practice we find the amplitude ridges to be superior. When both exist, we generally find they 
are indeed very close to one another, as expected by the perturbation analysis, but the phase ridges have a greater tendency to 
"break" at isolated points where modulation is particularly strong. In fact we find this to be the case when applying the phase 
ridge algorithm to the example in Fig. |2] with identical settings as for the amplitude ridges (not shown). Therefore based on 
experience we favor the amplitude ridges. 

We may similarly find the amplitude and phase estimates associated with the ridge-based signal estimate. Writing the 
estimated analytic signal in terms of an amplitude and phase 



we find, following the development in Section IIV-EI 



l + 0{d 
l + 0{6 



Nt) 



o 



o 



{4:^(t)} 
{43^(0} 



(81) 

(82) 
(83) 



so that the estimated amplitude and phase are the same as the amplitude and phase of the localized analytic signal up to second 
order in 6nt- 

It turns out that, had we derived the AWT ridge representation theorem with the more restrictive assumption that scale s lies 
within the 1 -neighborhood of an instantaneous frequency curve, we would have again found (iTSl i: this is why, in the preceding 
subsection, the ridge representation theorem was presented in a form appropriate for the larger 2-neighborhood. 



C. Instantaneous Frequency and Bandwidth Estimation 

Expressions for the estimated instantaneous frequency and bandwidth can also be found. A direct method of estimating the 
instantaneous frequency is simply through the scale frequency associated with the ridge curves, i.e. 

Q^-'^Ht) ^ [1 + 0(<5^J] . (84) 

However, the instantaneous frequency estimate formed in this way are not very satisfactory because they reflect the discrete 
scale levels s used in the numerical evaluation of the wavelet transform. Likewise, one could differentiate the amplitude and 
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phase of the estimated analytic signal but in our experience the discrete implementation of this differentiation tends to 

lead to rather noisy estimates. 

A better way to estimate both instantaneous frequency and bandwidth is to first form the quantities 



which we refer to as the transform instantaneous frequency and bandwidth. We then have the estimates 



(85) 
(86) 

(87) 
(88) 

which are obtained from the values of the instantaneous frequency and bandwidth along the ridge curve. In numeric implemen- 
tation, differentiation is thus performed prior to the lookup along ridges, rather than the reverse. The form of the instantaneous 
frequency estimate is found to be 



while that of the bandwidth is 

v^-^t) = v{t)+uj(t) X 



l uj"{t) v{t) u'jt) 



lv"it) v{t) v'{t) 



O 



0{5\ 



O 



(89) 



(90) 



as follow at once from ( |146l l of Appendix W\ The residual quantities £^,'3 (t) are given by jlSOt . The estimated instantaneous 
frequency and bandwidth plotted in Fig. E^-i have been constructed in this manner 

The leading-order perturbation term in the instantaneous frequency estimate (|89l l is at second order in 5nt^ but as v{t)/uo{t) 
is itself a first-order quantity, the estimated bandwidth is perturbed at first order in 5^^- The wavelet ridge estimation can 
therefore recover the instantaneous frequency with greater fidelity than it can the bandwidth. The instantaneous frequency 
estimate ( |89] | has the desirable property of being identical for both the amplitude ridge curves and phase ridges curves, 
unlike the direct estimates of instantaneous frequency (l84l i which differ at order Inserting ( ITTI i into ( |84| | shows that the 
instantaneous frequency estimate ( [89] l using either type of ridge curve is identical at leading order to that for direct estimate 
(|84] | of instantaneous frequency using the phase ridge curve. 

For reference, it is useful to compare with the rate of change of the amplitude and phase of the locahzed analytic signal 
x^{t). One finds 



uj^(t) = Q \ — \\i[x^{t)] \ = uj{t) 



1 



2 '^'tj3(t) 



0{6l^) + 0[e\;l{t,uj^lu{t))] 



for the rate of change of the phase and 

v4t) = ^[j^\n[x^{t)]\ =v{t)+Lo{t) X 



p2 



1 v"{t) v{t) v'{t) 



(91) 



(92) 



for the relative rate of change of amplitude. The form of the residual term is given in ( |137t . Comparison with ( [89l l and 
( |90l l shows that, while the instantaneous bandwidth estimate is identical to lowest perturbation order to the rate of change of 
amplitude of the localized analytic signal x^,{t), the same is not true for the instantaneous frequency estimate. 

The additional term in Lj^'^{t) compared with uj^{t) reflects the joint effect of contemporaneous amplitude and frequency 
modulation. It does not occur in (t) since 



^ QlnM/4i,.)|,^^^/^(,)^ (93) 



and evaluating the term proportional to aj'(t) from ( iTST i, we find it cancels a similar term in fl^ {t,uj^/uj{t)), leading to WH - 
The difference between uj^'^(t) and is therefore attributed to a contribution to the rate of change of phase (t) due 

to the motion of the instantaneous frequency curve across scales at a fixed time. The instantaneous frequency thus seems 
anomalous in that it is not completely controlled at lowest perturbation order by the localized analytic signal. 
Similarly we can estimate the second-order instantaneous modulation function p2 (t) by defining 



(94) 
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which is evaluated along a ridge, leading to 



(95) 




(96) 



where the leading-order term is again at order 0{6nt)^ ™d with the ellipses denoting a twice-differentiated residual term 
following the development for the once-differentiated residual term in Appendix |IV] In the example in Fig. |2] panels (j-1) 
show three versions of p^^ (t) estimated in this way using three different wavelets. 



The above development has shown that if the local stability level 6nt is known and is small compared to unity, one can 
choose a suitable wavelet such that the ridge-based estimates of the analytic signal its amplitude a+{t) and phase 4'+{t), 

its instantaneous frequency Lu{t) and bandwidth v{t), and its second-order instantaneous modulation function P2{t), are all 
close to their true values. A difficulty of course is that the local stability level ^^r^ is not generally known in practice. One can 
estimate Sn^^ but as was seen in Fig. |2] the estimated degree of smoothness of the signal depends upon the choice of wavelet. 

Nevertheless, by presuming that a certain estimated signal is in fact correct, one can gain insight into what estimated signals 
are reasonable. We apply this approach to the signal estimates shown in Fig. |2] Their are two considerations: whether the 
estimated signal is sufficiently smooth, i.e. is characterized by a sufficiently small 5nt^ secondly the wavelet is suitable 
for the stability level of the estimate it produces. For simplicity we take the time period T to be the entire record, and set the 
truncation level to = 2 so that we only need consider the second-order modulation function. 

In Fig. |2]-1, we have plotted p2{t) and drawn horizontal lines at ^/4. It is not the absolute degree of signal stability which 
matters for bias, however; rather, it is the value of |p2(0l relation to the wavelet duration P^.-y- Since Pp^-y = y^2/ Snt is 
the lowest-order stability condition, as discussed in (IIII-Gb . we should see |p2(i)| be bounded by these lines. One should keep 
in mind that the contamination of the signal by noise will tend to increase the roughness, so the estimated values of p2{t) 
are all anticipated to be somewhat too large. From inspection, we see that extension of p2{t) outside of these lines is minor 
for (i), occasional for (k), and — although it is difficult to see in this plot — extensive for (1). Since the great difference in the 
values of p2{t) and for the three estimated signals makes it difficult to compare them visually, we calculated some 

statistics to characterize their levels of variability. The mean values of the ratio \p2it)\/ (Pl^/'i) 

are 0.54, 0.74, and 2.10 for 

(j-1), respectively, while the corresponding median values are 0.47, 0.57, and 1.74. Since the suitability criteria require that 
this quanitity be smaller than unity, it appears that the wavelets used in the third column have a time duration that is too long, 
and this estimate would therefore be expected to be of a poor quality. 

The the level of variability in the third estimated signal clearly exceeds that expected from the suitability conditions. Let us 
say that the smoothest estimated signal, in Fig. |2};, is in fact the true analytic signal to be estimated. The extensive excursions 
of p2{t) outside the dotted lines in Fig. |2j means that the wavelet used in this column. Fig. [T];, is not suitable to analyze 
this signal. The ridge analysis using this wavelet has produced an estimated signal which it would not be able to recover 
accurately, an unacceptable result. On the other hand, the horizontal lines correspond to values of 6nt of 0.44, 0.22, and 0.01, 
respectively. Thus the estimated signal (j) is quite rough, and indeed appears obviously contaminated by noise. 

Assuming that the estimated signal is the true signal, we can iterate the estimation procedure and ask which of the iterated 
estimates shows the least error We solve for the median and mean values of — x+{t)]/x+{t)\'^, in which each of the 

three estimates signals in Fig. I3-1 plays the role of the true signal The mean values of the iterated deviations are 

0.040, 0.036, and 0.057, respectively, while the median values are 0.024, 0.014, and 0.022. This means that the wavelet used 
in the middle column is able to recover the estimated signal it produces with the greatest degree of fidelity. Thus while the 
true signal remains unknown, we can say from a quantitative analysis that the estimate in Fig. |2]<c is to be preferred. 

It was shown in Section IIII-GI that for 1 < 7 < 6, generalized Morse wavelet derivatives of all orders will satisfy the 
wavelet suitability criteria provided the lowest-order condition, Pp,^ < ^J2/5Nt, is also satisfied. This implies that a range of 
7 values could be chosen for a fixed Pp^^ and yield similar results. To check this, we compute the wavelet ridge estimates for 
the wavelets shown in Fig. [TJ and Fig. [T^, which like that in Fig. [TJ) have Pp^^ — 3, but with 7=1 and 7 = 6 respectively. 
The results (not shown) reveal both of the wavelet ridge estimates are very close to that for 7 = 3, as expected. This means 
that the error terms due to higher-order wavelet derivatives have been successfully contained to higher perturbation order by 
the wavelet suitability criteria. 

E. Implications for Choice of Wavelet 

In this section we show how the ideas developed in this paper guide the choice of wavelet appropriate to the analysis of a 
given signal, using the generalized Morse wavelets as the reference point [23], [24]. 

The higher-order wavelet properties will be addressed first. The wavelet suitability conditions imply 1 < 7 < 6 for the 
generalized Morse wavelets. The 7 = 3 wavelets are in a sense optimal for fixed Pp^^ since they have z-.p .-^{oJ p .'^) = 



D. Application 
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[24]. Thus after the leading-order terms proportional to P2{t), the set of terms proportional to P3{t) vanishes, and the next 
contribution imolves fourth-order signal variability. The second-order expansions we have emphasized are therefore particularly 
accurate for the 7 = 3 family. By contrast, it is apparent that an inopportune choice of analyzing wavelet could lead to a very 
poor estimate of the analytic signal. If one fixes Pp^^ = \fWl ^nd lets 7 increase without bound as /? decays to zero, violating 
the suitability conditions, ( fTSI l shows that \I'3.^^((jj^^) increases without bound. This implies there is no limit to how poor 
the estimated analytic signal can become if one chooses as an analyzing wavelet a function that, while analytic, is extremely 
asymmetric. 

Nevertheless, the wavelet suitability criteria give some latitude in the choice of 7. To understand why one might choose a 
particular value of 7 we consider first the roles of /3 and 7 more generally. We have noted that /3 controls the time decay, with 
'0(i)/^/'(O) ^ Meanwhile, 7 controls the high-frequency decay, as is clear from the frequency-domain form (fT4l) . 

Thus increasing 7 is attractive if one would like to extend the analysis closer to the Nyquist frequency. Decreasing 7 with 
P(i^-y held fixed, on the other hand, lets /3 and hence the rate of time decay be increased, minimizing leakage from distant 
times. Further details on the roles of /3 and 7 in setting the wavelet properties were considered by [24]. 

With the higher-order errors made small by the constraint 1 < 7 < 6, the leading-order error term is controlled by the 
choice of Pg,-,. Our analysis suggests that to minimize the leading-order error term, the wavelet should be chosen to be as 
short as possible — that is, ^ should be minimized. In the example discussed in Section [V-EI it was seen that the presence 
of noise compels us to choose large enough to stabilize the transform against random fluctuations. But a second factor 
opposing the desire to make P^ small is that a wavelet cannot be made vanishingly short and still have attractive properties 
as a bandpass filter. 

A conventional measure of the time-domain spread of a wavelet is its second moment 



Clearly the generalized Morse wavelets only have finite time spread cr^.^ for /3 > 1/2, since /3 = 1/2 implies V'(0/V'(0) ~ 
|i|-(3/2)^ is which case the integrand in the numerator Wt\ is proportional to t^^ . Such long time decay is useless in practice. 
At /? = 1, the time decay of the wavelet is already relatively slow at i^^, and so this in some sense represents a lower bound 
for useful value of /?. Then the smallest value of P/3 ,y satisfying the wavelet suitability conditions would occur at 7 = 1, 
where we have Pfj^^ = 1. This implies the wavelet executes one full cycle with its central window, an intuitive lower bound 
on the duration of a signal which is supposed to be a modulated oscillation. As a result highly variable signals with Jat^ of 
order unity will be problematic, but as mentioned earlier, such signals are not aptly described as modulated oscillations in the 
first place. 



VI. Discussion 

This work has derived fundamental properties of the continuous analytic wavelet transform (AWT). In particular we have 
calculated an exact form for the AWT of a signal which may depart substantially from the case of negligible modulation. The 
key to achieving this representation is an expansion of the signal in terms of a set of appropriate time-varying functions — the 
instantaneous modulation functions — which quantify the local degree of departure of the signal from a constant-amplitude, 
constant-frequency sinusoid. The AWT is found to involve a series of interactions of increasingly higher-order instantaneous 
modulation functions of the signal with increasingly higher-order frequency-domain derivatives of the wavelet, a result termed 
the AWT representation theorem. For signals or time intervals of a signal which are locally oscillatory, the AWT simplifies 
substantially. By constraining the magnitude of frequency-domain derivatives of the wavelet, the Taylor expansion of the AWT 
with respect to scale can be reduced to a handful of important terms in the vicinity of the instantaneous frequency curve. 

Wavelet ridge analysis, a means for estimating the properties of a modulated oscillation, was then revisited in the light of 
these results. Extending earlier work bounding the bias terms globally, we identified the lowest-order time-varying bias of the 
estimated signal properties when the amplitude and frequency modulation are not negligible. It was seen that amplitude- and 
phase-based ridge definitions are different from one another, but that this difference is fact negligible. The leading-order error is 
due to the smoothing of the analytic signal by the wavelet along the signal's instantaneous frequency curve, an object we term 
the localized analytic signal, and not to the deviation of the instantaneous frequency curve from the ridge. In fact the localized 
analytic signal controls to leading perturbation order not only the estimated of the analytic signal and its amplitude and phase, 
but also its instantaneous bandwidth; the instantaneous frequency, however, contains an additional term due to simultaneous 
amplitude and frequency modulation. All these quantities may be estimated with fidelity provided the signal modulation is not 
too strong and the wavelet is chosen appropriately. 

Given the ubiquity of modulated oscillatory signals in a number of applications, these results will enable better characteriza- 
tion of such signals, and will add to the theory underpinning existing estimation methods. For example, the discrete complex- 
valued decompositions mentioned in the Introduction have useful properties because they approximate a wavelet transform 
with an analytic mother wavelet function. Our results are applicable to most of these decompositions, up to some (small) 
corrective error term which decreases with increasing scale or length of wavelet. The Dual-Tree Complex Wavelet Transform 
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(DCWT) [1] is one such method. The DCWT has become an extremely popular tool in signal analysis, because it alleviates 
several shortcomings of the real discrete wavelet transform but maintains a controlled level of redundancy. Applications based 
on the observed properties of the DCWT coefficients are usually derived from its magnitude and phase properties [1]. By 
applying our derived understanding of the AWT we may quantify aspect of the behavior of the DCWT. Another transform 
whose higher-order properties can be approximately determined from the results derived in this article is the chirplet transform 
[44], [45]. Thus although the primary goal of this paper is to understand and improve wavelet-based estimates of oscillatory 
signals, the results should also find applicability to other local estimation methods of modulated signals. 



Appendix I 
A Freely Distributed Software Package 

All software associated with this paper is distributed as a part of a freely available Matlab toolbox called Jlab, written by 
the first author and available at http: / /www. jmlilly.net. The Jsignal module of Jlab includes numerous routines for 
high-quality wavelet ridge analysis suitable for large data sets. Given an analytic signal, instf req constructs the instanta- 
neous frequency, bandwidth, and second-order modulation function. The generalized Morse wavelets are implemented with 
morsewave, while their basic properties, peak frequency, and frequency-domain derivatives are computed in morseprops, 
morse f req, and morsederiv respectively. The time spread of the generalized Morse wavelets is computed by morsebox 
and bellpoly computes the Bell polynomials. The wavelet transform is implemented by wavetrans while ridgewalk 
has an efficient algorithm for finding the ridges. All routines are well-commented and many have built-in automated tests or 
sample figures. Finally, makefigs^ analytic generates all figures in this paper 



Appendix II 
Proof of the AWT Representation Theorem 

In this section we will use the notation ips{t) = ip{t/s)/s for a rescaled version of the wavelet. Inserting the local modulation 
expansion (|24] | into the wavelet transform dSST l. one obtains 

W^{t,s) = Wj:^{t,s)+WR^^Jt,s) (98) 

where [with pa{t) = 1] 

Tl = 

is the wavelet transform of the A^th-order time-domain polynomial from the instantaneous modulation function signal expansion 
(l24l i. Note that in (l98T l the anti-analytic contribution vanishes on account of the analyticity of the wavelet. W/?„^j(t, s) is 
implicitly defined by ( |98] | as the difference between the wavelet transform of the signal W^{t,s) and the wavelet transform 
of the expansion W^Sn ■s)- 

Now, the large-time decay of the wavelets is O (t^^^) [see (l53T l]. In order for the integrand in ( |99] l to be square integrable, 
it is clear we must have N < r^p — 2. Assuming that to be the case, (|99] l can simplify by substituting (l58l l for the nth 
dimensionless derivative ^I'„(w). Then (|99] l becomes 

W^At,^) - lx+m*isu;it))f2 ^~'^"^"'^*^ KiMt)) (100) 
2 n! 

n— 

and if we furthermore denote 

the AWT representation theorem ( l54l i follows by combining ( |98] ), ( II 001 ). and ( IIOII ). 

To obtain bounds on the residual term Wii„^^{t, s), we split the wavelet transform integration into an inner and outer 
portion. Choose an energy level a, with 1 — a << 1, which determines a wavelet half-width L^{a) as defined by ( |52l ). We 
then write 

(i, s) = (t, s;a) + Wo {t, s; a) - W^o.s„ (i, s; a) (102) 

where "I" and "O" denote integrations over the inner and outer ranges, respectively. The first of these three terms 

Wi,R^^,it,s;a) = ^x+{t) [ ' e^'^W^i?^+i(T, i) dr (103) 

gives the integral of the residual term Rn^i{t, t), defined in (|25] l, over the inner range. The second term 

Wo{t,s;a) = - (t) x+{t + t) dr + - ^* (t) x+{t + t) dr (104) 



2 
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is the integral including the entire signal over the outer range. The third term 

Wo,T.t,{t,s;a) = ^In{t,s;a) = - x+{t) 



n=0 



n=0 

+ ^ x+it) / (r) e-(*)- - [^(*)^]" ?"(^) (105) 



71=0 

is the integral of the summation over the outer range; here we have also defined the contribution from the nth term in the 
summation s;a). The reason for this seemingly circuitous route to obtaining a bound is that we have only assumed 
derivatives of x+{t) to exist on the time interval \t\ < sL^{a). Outside this interval the residual Rn+i{t, t) is no longer given 
by (IZST i, but is still defined implicitly as the difference between the time series and the summation. 
One finds the squared magnitude of the inner term is subject to the bound 

^2 

(106) 



2,1, /.m2 „ 



sup \RN+i{T,t)y 



which increases with increasing a; here is the wavelet energy 



By the triangle inequality together with the Cauchy-Schwarz inequality we also find that 

„2 



\Woit,s-af < i||x+|| 



1 - a \ c: 



s 



(107) 



(108) 



[with ||a;+|| denoting the norm of x+{t)] and thus the contribution to the wavelet transform from \t\ > sL^,{a) is negligible 
if a is chosen to be sufficiently close to unity. The contributions of these two terms are therefore antagonistic. 
To find the bound on the third term, we may note 

/ ^2.1/(2^^-1) 

L-^\a)^{-{l~a){2r^-l)f\ (109) 



and then we find 



\Pn{t)\ X 



2(r^-n)-l 



2(r^ - n) - 1 



(110) 



which follows in a few Unes of algebra from ( I105l l using the triangle inequality together with the observation and also ( BTI ) 
and ( |53] ). Here 6^ > and c?^ > are constants chosen such that 



m)\ < K\tr'^ 



where r^, gives the wavelet time decay. Thus finally 

|Wo,s„(i,s;a)|'<^|/„(t,s;a)|' 



AT 



(111) 

(112) 



(113) 



by the triangle inequality. The three components of Wij„^j(t, s) are therefore bounded, and WRj^^^{t, s) itself is bounded by 
another application of the triangle inequality. 



Appendix III 
Proof of the AWT Scale Deviation Expansion 

Noting then the normalized wavelet derivatives have the Taylor series expansion 

'^n{suj) = — *„+„i(w^) ( 1 

m— ^ ^ 

we insert this into the wavelet ridge representation theorem given in ( |54] |. yielding 



W^{t,s) =x+it) 



N 



EE- 

n— m— 



n!m! 



*m+n(wV.)Pn(t) X 



SUj{t) \ " soj{t) 



(114) 



(115) 
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where we recall that '^o{ll>^) = 1 and po{t) = 1. Now expanding powers of suj{t)/uj^ via the binomial theorem, one finds 



;{t) 



SLu{t) 



1 + 



(116) 



and inserting into ( II 15l l, we obtain the triple summation (l72b . Writing out all terms up to to = 2, n = 2 leads to 



ipi{t) 



-Mt) 



+ e^,3{t,s) (117) 



where e^.3(t, s) captures the influence of terms of higher order in the instantaneous modulation functions Pn{t), while the 
ellipses denote terms of higher order in the scale deviation Aaj(t, s) = sijj{t)/uo^ — 1. Note that we have used the facts that 
the first derivative of the wavelet at the peak frequency ^^liu)^,) vanishes by definition, and that '^'{lo^,) = 2. 



Appendix IV 
Bounding the Differentiated Residual 

For later use it will be necessary to obtain expansions for time and scale derivatives of the transform residual term e^^N+l{t^ s) 
dlOll ). which we denote by 



-{*} 



d 



As} 



OS 



(118) 



(119) 



Note that the additional factor of Lu^p in the denominator of the latter is convenient since it renders the derivative dimensionless, 
Uke the scale derivative (since s is itself dimensionless). 

Throughout this section we assume (t, s) S 'R-2:4){5nt) so that suj{t)/uj^ = l + O {^Nt)- derivative of the transform 

residual is 



d 



dt 



:X+{t)'^*{su{t)) 



but since suj{t)/uj^ — 1 + {S%^) one has 



^) = {e^^N+i{t, s)) + — 



where the order of the second term remains to be found. Likewise we find for the scale derivative 



and from ( I114l i one has 



*i(su;(t)) = 0<^*2K)x 



SU!{t) 



0{S 



Nt 



for {t, s) € 7^.2;^ ((5 jvr) ™d invoking the wavelet suitability criteria. Therefore il22\ becomes 



again leaving the order of the second term to be found. 

We can obtain bounds for the numerators in the preceding expressions as follows. Define 

oj^p ot 



d 



(120) 

(121) 

(122) 
(123) 

(124) 

(125) 
(126) 



IMPERIAL COLLEGE TECHNICAL REPORT TR-07-02 



23 



and note that C/^(t, s) and V^{t, s) may themselves be written as wavelet transforms using modified wavelets. Define 

0{t) = ~^'{t)/Lj^ (127) 
^(t) = -[yjit)+ti;\t)] (128) 

having Fourier transforms Q{lj) = ~i{uj/uj^)'9{uj) and — (^-^"^{lu) respectively. The differentiated wavelet transforms 
may then be written 

U^{t,s) = J U*(^^^x{T)dT (129) 

/OO 1 / , \ 

-if* i^^jxiT)dT (130) 

using the definition of the wavelet transform (|9]). Note that by incorporating the derivatives into the wavelets, the original signal 
remains in both integrands. 

The functions 9{t) and (p{t) are valid wavelets provided that they have finite energy and that the Fourier transform ^'((jj) of 
the original wavelet satisfies 

\uj\\^{uj)f duj < OO (131) 



OO 



duj < OO (132) 



which together constitute the admissibility conditions for 9{t) and (p{t) respectively. Now inserting the local modulation 
expansion of the signal ( l24l l into ( 11291 ) and (11301 ) we obtain [mirroring the development of Appendix [III 

s d 

— —W^it,s) = Us^{t,s) + UR^^,{t,s) (133) 
uj^ at 

s^^W^,{t,s) = V^^{t,s) + VR^^,{t,s) (134) 



where the individual terms are defined analogously to those in the transform of the original signal as in ( l98l l. In order for 
the integrals implied on the right-hand side to be well defined, we must have the truncation level N satisfy N < rg — 2 and 
N < — 2, where rg and r^p are the long-time decay of the differentiated wavelets defined as in ( |53] |. Henceforth we assume 
this to be the case. 

Since by construction, [/^n (^i ^) the right-hand side of (|133t is equal to the derivative of the summation term on the 
left-hand side, and similarly for V^jy (i, s) in (|134| i. we may also note 

s d 
uj^ dt 

■s-^^WR^^,it,s) - (136) 



= UR^^,{t,s) (135) 



ds 

The differentiated residuals dl 18b and (II 19l l thus become 

4V,(M) = 0(e„«(M)) + x^^|f|i^ ,,37, 

for the time differentiation and 

for the scale differentiation. URj^^-^{t, s) and VR„_,_,(i, s) may then be bounded in the same manner as for WRj^^^{t, s) in 
Appendix HIl but using the modified wavelets 9{t) and (p{t). 



Appendix V 
Proofs of the Forms of the Ridge Curves 

To obtain expressions for the ridge curves, it is necessary to assume at the outset the order of the deviation of a ridge from 
an instantaneous frequency curve. We assume (t, s^'^) € 'R-2;ip{<>Nt)^ i-S- that the ridge curve lies in the 2-neighborhood of the 
instantaneous frequency curve; it is found that the ridge equations do indeed have solutions within this neighborhood. Also, 
for convenience we take ^'„(cj^) to be real- valued for n < A. For the amplitude ridges, we wish to solve ( fTTHTSl ). while the 
phase ridges satisfy ( fT9H20] l. 
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Amplitude ridges will be considered first. Since ln(l + = a; — + . . . , we may write the log of the analytic wavelet 
transform as [from ( [98] l1 



In W^{t,s) = lnx+{t) + Ax^{t) + AW^{t,s) - i [Ax^{t) + AVr^(t,s)]^ + . . 



(139) 



where we note that the squared term cannot be neglected in what follows. Inserting ( l60l l and ( TTSI i for Ax^{t) and AW^,{t, s), 
respectively, one obtains for the real part 



^{\nW^{t, s)} = ^^^\nx+{t) + Ax^{t)-^[Ax4t)]^ 



Auj(t,s 
1 



[Aijit, s)Y *2(wv>) + O {5%^) + O {e^4t, s)) (140) 



with truncation level Nt = 4. 

In the 2-neighborhood (f, s) G 'R-2-4>{Snt) of ™ instantaneous frequency curve 

s|-A^(s,0 = ^=O(l) (141) 

OS LO^ 

so that taking a scale derivative of such terms transforms an 0(5^^) term into an 0(1) term. Applying the scale derivative 
to (1140b and evaluating the result along the ridge then leads to 



6 



2Lu{t) UJ^{t) 



- 1 



«'2(c^^) + O (4J + O [Sn^ X e};f{t)} = (142) 



and dividing through by \&2(w^) = O {5j^^) one obtains (f76] l for the amplitude ridges. The residual quantity in the above is 
defined as 

(143) 



(144) 



where an expression for s^]^^-^ {t, s) is given by (I138l l of Appendix II VI 

For the phase ridges, we proceed by defining the complex-valued transform quantity 

d 

H^{t, s) = n^,{t, s) - iT^,{t, s) = ~i— \nW^,{t, s) 

in analogy with the signal's complex instantaneous frequency ri{t) = w(t) — iv{t). We then differentiate the wavelet transform 
W^{t, s) as expressed in (l98T l, including terms from both Ax,i,{t) and also AW^(i, s). The orders of the various terms can be 
assessed by recalling WT\ for derivatives of the instantaneous modulation functions, and by noting that for {t, s) G 'R-2;ip{5nt) 
we have 

|Ac.(i, ^) = ^ = [1 + OiS%^)] = X 0(4J (145) 

for time derivatives of the scale deviation. With a truncation level of Nt = 3, ( 1144b becomes for {t,s) € 'R-2:-4}{Snt)^ by 



differentiating (|139b . 



d_ 

dt 



Auj{t,s)pi{t)-i]^P2{t) 



+ u;{t) X O (4J + u;{t) x O (e{*i^+i(t, s) 



(146) 



where the residual term is defined by (1137b of Appendix |IV] Writing out terms we find 



1 ^"(t) u(t) tj'(t) l v"{t) . v{t)v'{t) 

+ u;{t) X O (4J + u;{t) x O (4*iv+i(^> 



where the entire term in brackets is of second order in 5^^.. 
Rearranging the phase ridge condition iT% yields 

s^P>(i)w(t) 1 



and inserting the imaginary part of ( 1147b . one finds 



UJ,p 



1 - 



lLu"{t) U0'{t)v{t) 



$2K) + 0(4.)+0{4'3^W} 



= 1 



(147) 



(148) 



(149) 
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from which the form of the phase ridge curve dTTj i follows. Here we have defined 



,{•.*} (t s^ A 



(150) 



as the residual term along the ridge. 
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